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Shortia brevistyla comb. et stat. nov., (Diapensiaceae), 
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L. L. Gaddy', T. H. Carter’, B. Ely’, S. Sakaguchi’, A. Matsuo” and Y. Suyama5 


'President, terra incognita and Adjunct Faculty, 
Department of Biology, University of South Carolina, Columbia, SC, USA 
Ph. D. Candidate, Genetics Laboratory, Department of Biology, University of South Carolina 
>Director, Genetics Laboratory, and Professor of Biology, University of South Carolina 
4Graduate School of Human and Environmental Studies, Kyoto University, Japan 
“Graduate School of Agricultural Sciences, Tohoku University, Miyagi, Japan 


Correspondence to Ilgaddy2@gmail.com. 
ABSTRACT 


Morphological, geographic, and molecular data justify recognition of Shortia brevistyla (Shortia 
galacifolia Torr. & A. Gray var. brevistyla P.A. Davies) as a distinct species rather than a variety of Shortia 
galacifolia. All known populations of the new Shortia brevistyla are found within a 10 km radius on the 
headwaters of the Catawba River in McDowell County, North Carolina, approximately 100 km northeast 
of the range of S. galacifolia. Shortia brevistyla has significantly smaller flowers, shorter styles, shorter 
petals, and smaller leaves than S. galacifolia. Genetic data indicate that the two species differ at five of 
210 nucleotide positions in the ITS] ribosomal region. Genetic divergence models indicate that the two 
species diverged approximately 20,000 years ago during the glacial maximum of the Pleistocene. Published 
on-line www.phytologia.org Phytologia 101(2): 113-119 (June 21, 2019). ISSN 030319430. 
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Shortia galacifolia Torr. & A. Gray, the only species of Shortia heretofore known from North 
America (Nesom, 2009), was described by Asa Gray from a specimen in Paris collected in 1839 by Andre 
Michaux from the “Hautes Montagnes de Carolinie” (Williams et al. 2004). Michaux had scribbled on the 
herbarium sheet that the plant may be a Pyrola or a new genus. After extensive searches for the plant in 
the Carolinas, it was found to occur at low elevations (ca. 300-600 meters) along the Blue Ridge front, not 
in “high” mountains. Two major locations of the plant have been found in the Carolinas, one in the 
Keowee-Toxaway drainage of the headwaters of the Savannah River in North and South Carolina, and 
another on the headwaters of the Catawba River in North Carolina. Interestingly, between the areas where 
the plant has been found, there is a gap of approximately 100 km where no Shortia plants have ever been 
seen. Davies (1952) named the Catawba drainage plants in North Carolina “var. brevistyla,” implying that 
the Keowee-Toxaway plants were “var. galacifolia’ and concluding (as later did Zahner and Jones 1983) 
that the type material for Shortia galacifolia (in Paris) was collected from plants in the Keowee -Toxaway 
drainage in South Carolina (Davies 1956). Morphological, geographic, and molecular data indicate that 
the Catawba shortia - heretofore, Shortia galacifolia (Torrey & Gray) var. brevistyla P.A. Davies - is 
justifiably treated at the specific rank. It is morphologically distinct and geographically separated from 
the Keowee-Toxaway populations of typical S. galacifolia, and the two taxa have diverged significantly in 
nucleotide (ITS1) sequences. 
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METHODS 


Single leaves were removed from individual Shortia plants sampled from both Catawba River and 
Keowee-Toxaway populations. Individual leaves were frozen with liquid nitrogen then ground using 
SPEX Sample Prep 2010 Grinder (Metuchen, NJ). DNA extractions of the leaves were performed with the 
Promega Wizard DNA Purification Kit (Promega Madison, WI) according to the manufacturer’s 
instructions. Primers complementary to highly conserved regions of the ribosomal ITS] region (Cheung 
2016) were ordered from Eurofins Genomics (Louisville, KY). The corresponding region of the Shortia 
genome was amplified using PCR with the following conditions after an initial denaturation step of 4 
minutes: denaturation at 94°C for 30 seconds, annealing at 55°C for 40 seconds, and elongation at 72°C for 
60 seconds. After a final elongation step of 10 minutes at 72°C, a sample of the resulting PCR product was 
subjected to agarose gel electrophoresis to verify amplification and then sent to Eurofins Genomics for 
Sanger sequencing. The resulting nucleotide sequences were analyzed utilizing UGENE software 
(Okonechnikov 2012) and NCBI Blast (https://blast.ncbi.nlm.nih.gov/Blast.cgi). 


Total DNA was extracted from dried leaf samples (ca. 0.5 cm’) using CTAB 
(cetyltrimethylammonium bromide) method (Murray and Thompson, 1980) in 1.2mL tubes, after washing 
the leaf powder for two times with 0.8 mL of a buffer, containing 10% polyethylene glycol, 0.35M solbitol, 
50mM Tris-HCl, 0.1% Bovine Serum Albumin and 0.1% B-melcaptoethanol. Genetic polymorphisms were 
detected with MIG-seq [multiplexed ISSR (inter-simple sequence repeats) genotyping by sequencing] 
technique, a PCR (polymerase chain reaction)-based procedure for constructing highly reduced 
representation libraries, involving de novo SNP (single nucleotide polymorphism) discovery, and their 
genotyping using high-throughput sequencing (Suyama and Matsuki, 2015). Briefly, 10 ng of DNA was 
used to amplify anonymous genomic regions using multiplexed PCR with tailed ISSR primers as the Ist 
PCR. 


Eight primers of the most recommended set of MIG-seq primers (set-1) (Suyama and Matsuki 
2015) were employed for the 1st PCR. PCR thermal profile was the same with the original protocol, except 
that the annealing temperature set to 38°C for satisfactory amplifications across samples. Subsequently, the 
Ist PCR products were used as templates for the 2nd PCR (tailed PCR). Using common forward and 
indexed reverse primers, this step permits the addition of complementary sequences for the binding sites of 
Ilumina sequencing flow cell and index for each sample to the Ist PCR products. Then, the 42 PCR 
products were purified, fragments in the size range of 350-800 bp were isolated, and their final 
concentrations were measured by quantitative PCR. Sequencing of the multiplexed library was performed 
with an Illumina MiSeq Sequencer, using MiSeq Reagent Kit v3 (150 cycle, Ilumina). 


After read trimming using Trimmomatic v. 0.32 (with the following commands LEADING: 19, 
TRAILING: 19, SLIDING WINDOW:30:20, AVGQUAL:20 and MINLEN:51) (Bolger et al., 2014), Stacks 
1.08 (Catchen et al., 2011) was used to process the MIG-seq reads with the following parameter settings; 
the minimum number of identical reads required to create a stack (m = 3), the nucleotide mismatches 
between loci within a single individual (M = 2), and the mismatches between loci when building the 
catalogue (n = 1). The SNP genotype for each individual was exported with a minimum read depth of 10, 
using the ‘populations’ command. The genotype data was then processed using PLINK v. 1.07 software 
(Purcell et al., 2007), and markers with a minor allele frequency < 0.03, missing individual rate > 0.9, 
missing locus rate > 0.9, and significant deviation from Hardy-Weinberg equilibrium (P < 0.01) were 
filtered out. 


Population genetic statistics (the number of polymorphic loci, number of alleles, observed and 
expected heterozygosity, and inbreeding coefficient) were summarized for each population using GenAlEx 
6.5 (Peakall and Smouse, 2012). The significance of Hardy—Weinberg equilibrium (heterozygote 
deficiency) were examined by exact test implemented in Genepop 4.2 (Rousett, 2008). Population 
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differentiation was assessed by a summary statistic of Fst and AMOVA (analysis of molecular variance) 
using GenAIEx 6.5, and visualized by PCA (principal coordinate analysis) using ‘adegenet’ package 
(Jombart, 2008) in R 3.1.0 (R Development Core Team 2014). 


Past population demography was modeled by applying a pure isolation model (Hey and Nielsen 
2007) to the MIG-seq sequence data. The population model assumed an ancestral population split into two 
descended populations (i.e. extant var. galacifolia and var. brevistyla) at a past time (A. These populations 
were allowed to have different parameters of effective population size (anc, Aeic, and Ary). In Bayesian 
parameter estimation using [Ma2 program (Hey and Nielsen, 2007), the maximum priors for each parameter 
was set as follow: t = 1, Banc = Aste = Oory = 3, and under this prior settings, 10 Metropolis-coupled MCMCs 
with a heating scheme of geometric model (with parameter specifications of -ha 0.99 and -hb 0.75) were 
generated for 3 million steps after a burn-in period of 1 million steps. Infinite-site model was used to model 
the sequence evolution of MIG-seq data. To confirm the adequate chain mixing and parameter convergence, 
independent two runs were performed. After obtaining the posterior parameters, demographic parameters 
were scaled to absolute divergence time in years and effective population size, by applying the geometric 
mean of 7.0 x 10° (substitutions/site/year) estimated for Arabidopsis species (Ossowski et al. 2010) and a 
generation time of 10 years for Shortia. 


DESCRIPTION OF SPECIES 
SHORTIA BREVISTYLA (P. A. Davies) Gaddy, comb. et stat. nov. (Figure 1). 


Shortia galacifolia var. brevistyla P.A. Davies, Basionym. Rhodora 54: 124. 1952. 
TYPE: U.S. A. North Carolina, McDowell Co., John’s Creek (Fish Hatchery Creek) above the 
Fish Hatchery, on moist creekbank under Rhododendron maximum, 23 March 1951, P. A. Davies 
9149 (holotype: GH!; topotype: USCH!). 


Differs from Shortia galacifolia Torrey & Gray in that its styles are shorter, its petals are shorter 
and shallowly toothed, and its leaves are smaller. 


Rhizomes: slender, scale-leaved, lignescent. Stems: erect, unbranched. Leaves: basal, rosulate 
from rhizome buds, 20-55 mm in length; petiole present; blade orbiculate to elliptic-orbiculate, ovate- 
oblong, or ovate, margins coarsely crenate-serrate, apex emarginate to truncate, surfaces glabrous, pinnately 
veined. Scapes: bracteate, elongating after flowering. Inflorescences: solitary. Flowers: sepals distinct; 
petals 12-17 mm long, corolla rotate to campanulate, lobes white to pink to faded rose-purple, margins 
slightly toothed; anthers 2-locular, without basal spurs, longitudinally dehiscent; filaments adnate to corolla 
tube; staminodes present. Styles 6-12 mm in length. 


Davies (1952) used style length as the primary character in his description of Shortia galacifolia 
var. brevistyla. Although the styles are significantly shorter than those of galacifolia, styles in S. brevistyla 
populations are often variable in length. And although petals and leaves are significantly shorter in S. 
brevistyla, the best field character, however, is the shallow, blunt teeth on the petals of S. brevistyla versus 
the deeply laciniate teeth on the petals of S. galacifolia (Figure 1; Table 1). 
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Figure 1. Typical flowers of brevistyla (left) (2.5x) and galacifolia (2x). 


Table 1. Morphology of Shortia galacifolia and Shortia brevistyla.' 


SPECIES FLOWER LEAF PETAL PETAL STYLE 
SHAPE LENGTH LENGTH DISTAL LENGTH 
MEAN MARGIN 


37-82 mm (54) | 16-25 mm 12-18 mm 


20-55 mm (41) 12-15mm_ | shallowly toothed 6-12 mm 
campanulate 


'Data from Davies (1952), Hatley (1977), Nesom (2009), and L. L. Gaddy (field observations: 2018-19). 


RESULTS AND DISCUSSION 


Two independent analyses of the chromosomal ribosomal ITS1 region were performed using DNA 
isolated from Shortia plants sampled from two different locations within each of the two geographic regions 
in North Carolina and South Carolina—the Catawba shortia region and the Keowee-Toxaway region— 
where North American Shortia is found. In both sample sets, we obtained 210 base pairs of nucleotide 
sequence for each sample that was 100% identical for all the North Carolina specimens as well as the 
Shortia ITS1 sequence present in the GenBank database (Table 2). Similarly, the nucleotide sequences of 
the South Carolina samples were identical, but differed from the North Carolina samples at five nucleotide 
positions. Therefore, we concluded that the Catawba River drainage North Carolina Shortia is genetically 
distinct from the Keowee-Toxaway (Savannah River) drainage South Carolina Shortia and that they have 
been reproductively isolated from one another long enough for ITS1 regions to diverge at 2% of the 
nucleotide positions. 


Data from the laboratories of Dr. Shota Sakaguchi revealed that after adapter and quality trimming, 
an average of 191.4k sequence reads were obtained for the individual samples. The reads were then 
assembled and filtered by Stacks 1.08 and PLINK v. 1.07, which resulted in a genotype matrix consisted of 
97 SNP markers. The genotyping rate at these markers was high: 97.1% for galacifolia and 99.0% for 
brevistyla. Significantly lower genetic diversity in brevistyla is revealed in multiple statistics (Table 2). 
Among the 92 markers, only 19.6% showed allelic variation in brevistyla, while 93.4% were polymorphic 
in galacifolia. The number of alleles (Na = 1.92 in galacifolia vs. 1.20 in brevistyla) and the heterozygosity 
index (e.g., He = 0.266 in galacifolia vs 0.072 in brevistyla) were consistently low in brevistyla. The 
inbreeding coefficient values were slightly negative in both species (they are both narrow endemics) but 
were not significantly deviated from Hardy-Weinberg equilibrium (P > 0.05 for both populations) (Figure 
2; Table 3). 
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Table 2. Divergent nucleotide positions in comparison of ITS1 regions in Keowee-Toxaway drainage (SC) 
and Catawba River drainage (NC) Shortia populations. 


Nucleotide Number! 


'Sequence number from AY049800, labeled “S. galacifolia’” ITS1 sequence in 
Genbank (as it turns out, the Genbank sequence is from S. brevistyla). 


Table 3. Summary genetic diversity statistics for galacifolia and brevistyla.' 


| Species | UN UT OPT Nj | CH S| Hes S| is 
galacifolia average 21.36 0.934 1.92 0.270 0.266 
(n=22) SE 0.09 -- 0.03 0.018 0.015 0.026 
brevistyla average 1.20 0.095 0.072 
(n=20) SE 0.04 -- i 0.024 0.017 0.034 

'N: number of genotyped individuals, P: proportion of polymorphic loci, Na: number of alleles, Ho: 

observed heterozygosity, He: expected heterozygosity, Fis: inbreeding coefficient. 


PCO 2 (6.9%) 


PCO 1 (35.0%) 


Figure 2. Plot of principal coordinate analysis scores of individual samples: |-galacifolia; 2-brevistyla. 
Numbers on the coordinate axes are the percent variance accounted for by that axis. 
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The independent two runs using IMa2 program gave highly consistent posterior distributions of 
demographic parameters, indicating the robustness of our Bayesian analyses. The estimated effective 
population size of was 5396 (2211, 9641) for galacifolia and 619 (265, 2388) for brevistyla, respectively 
[HiPt (95% low, high) values are shown as posterior distribution summary]. The larger population size 
estimated for the common ancestral population [20609 (15302, 28216)] showed the extant two “species” 
experienced demographic decline after lineage divergence. The divergence event was estimated to have 
occurred 20049 (8255, 53070) years before present, a timing roughly corresponding to the last glacial 
maximum of the Pleistocene. 


Like other species of Shortia (Gaddy and Nuraliev 2017), both North American species of the genus 
are narrow endemics with very small ranges and exacting habitats. All the known populations of S. 
brevistyla are all found within 10 km radius in the upper Catawba River drainage just north and northeast 
of Marion, North Carolina in McDowell County. The more numerous populations of Shortia galacifolia 
all fit into an area within a 15 km radius around its type locality in the Keowee-Toxaway drainage just north 
of the Lake Jocassee dam (in Oconee County, South Carolina). Finally, the southwestern most S. brevistyla 
population known is about 100 km northeast of the northeastern most S. galacifolia population. 


Shortia brevistyla appears to have speciated due to genetic drift in isolated populations of Shortia 
galacifolia with low heterozygosity and self-pollinating plants. Estimates, using Bayesian estimation in 
the IMa2 program, indicate that these two species diverged approximately 20,000 years ago during the time 
of the glacial maximum (Osssowski et al. 2010). 
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